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ABSTRACT 

The capabilities of flight control systems can be en- 
hanced by designing them to emulate functions of natural 
intelligence. Intelligent control functions fall in three cat- 
egories. Declarative actions involve decision-making, 
providing models for system monitoring, goal planning, 
and system/scenario identification. Procedural actions 
concern skilled behavior and have parallels in guidance, 
navigation, and adaptation. Reflexive actions arc sponta- 
neous, inner-loop responses for control and estimation. 
Intelligent flight control systems learn knowledge of the 
aircraft and its mission and adapt to changes in the flight 
environment. Cognitive models form an efficient basis 
for integrating "outer-loop/inner-loop" control functions 
and for developing robust parallel-processing algorithms. 

INTRODUCTION 

Recounting personal experiences in confronting wind 
gusts, one of the Wright brothers wrote, "The problem of 
overcoming these disturbances by automatic means has 
engaged the attention of many ingenious minds, but to my 
brother and myself, it has seemed preferable to depend 
entirely on intelligent control" [1, 2], The Wright broth- 
ers’ piloting actions depended on proper interpretation of 
visual and inertial cues, demonstrating biological intelli- 
gent control. In the past, human pilots flew aircraft 
through manual dexterity, informed planning, and coordi- 
nation of missions. As aircraft characteristics and tech- 
nology have allowed, an increasing share of the aircraft's 
operation has come to rely on electro-mechanical sensors, 
computers, and actuators. Panel displays have enhanced 
decision-making, stability augmentation systems have im- 
proved flying qualities, and guidance logic has carried 
machine intelligence to the point of "hands-off’ flying for 
much of a modem aircraft’s mission. 
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In a contemporary context, intelligent flight control 
has come to represent even more ambitious plans to 

• make aircraft less dependent on proper human 
actions for mission completion, 

• enhance the mission capability of aircraft, 

• improve performance by learning from experience, 

• increase the reliability and safety of flight, and 

• lower the cost and weight of aircraft systems. 

This paper presents concepts for intelligent flight control 
through the aid of what were once called "artificial” de- 
vices for sensing, computation, and control. We distin- 
guish between control functions according to a cogni- 
tivc/biological hierarchy that is bounded on one end by 
declarative functions , which typically involve decision- 
making, and on the other by reflexive functions , which are 
spontaneous reactions to external or internal stimuli. 

In a classical flight control context, declarative func- 
tions are performed by the control system's outer loops , 
and reflexive functions are performed by its inner loops. 
At an intermediate level, procedural functions — like re- 
flexive functions - have well-defined input-output charac- 
teristics but of a more complicated structure. Traditional 
design principles suggest that the outer-loop functions 
should be dedicated to low-bandwidth, large-amplitude 
control commands, while the inner-loop functions should 
have high bandwidths and relatively lower-amplitude ac- 
tions. There is a logical progression from the sweeping, 
flexible alternatives associated with satisfying mission 
goals to more local concerns for stability and regulation 
about a desired path or equilibrium condition. 

FOUNDATIONS FOR INTELLIGENT 
FLIGHT CONTROL 

Intelligent flight control design draws on two appar- 
ently unrelated bodies of knowledge. The first is rooted in 
classical analyses of aircraft stability, control, and flying 
qualities. The second derives from human psychology 
and physiology. The design goal is to find new control 
structures that are consistent with the reasons for flying 
aircraft, that bring flight control systems to a higher level 
of overall capability. 
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Aircraft Flying Qualities and Flight Control 

An aircraft requires guidance, navigation, and control 
to perform its mission. As suggested by Fig. 1 , a human 
pilot can interact with the aircraft at several levels, and his 
or her function may be supplanted by electro-mechanical 
equipment. The pilot performs three distinct functions: 
sensing, regulation, and decision-making. These tasks 
exercise different human characteristics: the ability to see 
and feel, the ability to identify and correct errors between 
desired and actual states, and the ability to decide what 
needs to be done next. The first depends on the body's 
sensors and the neural networks that connect them to the 
brain. The second relies on motor functions enabled by 
the neuro-muscular system to execute learned associations 
between stimuli and desirable actions. The third requires 
more formal, introspective thought about the reasons for 
taking action, drawing on the brain's deep memory to re- 
call important procedures or data. Sensing and regulation 
are high-bandwidth tasks that allow little time for deep 
thinking.. Decision-making is a low-bandwidth task that 
requires concentration. Each of these tasks exacts a 
workload toll on the pilot. 

Pilot workload has become a critical issue as the 
complexity of systems has grown, and furnishing ideal 
flying qualities throughout the flight envelope has become 
an imperative. It is particularly desirable to reduce the 
need to perform high-bandwidth, automatic functions, 
giving the pilot time to cope with unanticipated or un- 
likely events. In the future, teleoperatcd or autonomous 
systems could find increasing use for missions that expose 
human pilots to danger. 

Research on the flying (or handling) qualities of air- 
craft has identified ways to make the pilot's job easier and 
more effective, and it provides models on which auto- 
matic systems might be based. The first flying qualities 
specification simply stated, "(the aircraft) must be steered 
in all directions without difficulty and all time (be) under 
perfect control and equilibrium" [3,4]. Further evolution 
of flying qualities criteria based on dynamic modeling and 
control theory has led to the widely used U. S. military 
specification [5] and the succeeding military standard |6). 

Flying qualities research has led to the development 
of control-theoretic models of piloting behavior. Most of 
these models have dealt with reflexive, compensatory 
tracking tasks using simple time-lag and transfer function 
models [7, 8] or linear-quadratic-Gaussian (LQG) opti- 
mal-control models [9, 10]. Some treatments go into con- 
siderable detail about neuro-muscular system dynamics 
[11, 12]. These models often show good correlation with 
experimental results, not only in compensatory tracking 
but in more procedural tasks: the progression of piloting 
actions from single- to multi-input strategics as the com- 
plexity of the task increases is predicted in [1()|, while 
test-pilot opinion ratings are predicted by a "Paper Pilot" 
in [13]. These results imply that computer-based control 
laws can perform procedural and reflexive tasks within 
the fit error of mathematical human-pilot models. Insight 
on the human pilot's declarative actions can be drawn 


from [14-16], which introduce the types of decisions that 
must be made in aerospace scenarios, as well as likely 
formats for pilot-vehicle interface. 



Figure 1. Guidance, Navigation, and Control Structure. 

Figure 1 also portrays a hierarchical structure for 
stability-augmentation, command-augmentation, autopi- 
lot, and flight-management-system functions that can be 
broken into reflexive and declarative parts. Stability 
augmentation is reflexive control provided by the inner- 
most loop, typically implemented as a linear feedback 
control law that provides stability and improves transient 
response through an Estimation/Compensation block. 
Forward-loop control provides the shaping of inputs for 
satisfactory command response through a Con- 
trol/Compensation block, again employing linear models. 
The combination of control and estimation can be used to 
change the Hying qualities perceived by die pilot, or it can 
provide a decoupled system for simplified guidance 
commands [17-20]. A basic autopilot merely translates 
the human pilot's commands to guidance commands for 
constant heading angle, bank angle, or airspeed, while the 
Guidance block can be expanded to include declarative 
[light management functions, using inputs from Naviga- 
tion sensors and algorithms. 

Intelligent functions have been added to flight control 
systems in the past. Gain scheduling and switching im- 
prove performance in differing flight regimes and mission 
phases. Control theory, heuristics, and reduced-order op- 
timization have been used to achieve near-optimal trajec- 
tory management in many flight phases (e.g., [21-23]). 
The Guidance, Navigation, and Control (GNC) Systems 
for Project Apollo's Command/Service and Lunar Mod- 
ules provide an early example of intelligent aerospace 
control [24-26]. The state-of-the-art of aircraft flight 
control systems has progressed to comparable levels and 
beyond, as represented by systems installed in modem 
transport and fighter aircraft (e.g., [27, 28]). 

Intelligent flight control 1 can be justified only if it 
materially improves the functions of aircraft, if it saves 


1 As used here ’’intelligent flight control" subsumes "intelligent 
guidance, navigation, and control.” 
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the time and/or money required to complete a mission, or 
if it improves the safety and reliability of the system. In- 
teresting philosophical problems can be posed. Must ma- 
chine-intelligence be better than the human intelligence it 
replaces in order for it to be adopted? We are willing to 
accept the fact that humans make mistakes; if a machine 
has a similar likelihood of making a mistake, should it be 
used? Lacking firm knowledge of a situation, humans 
sometimes gamble; should intelligent machines be al- 
lowed to gamble? When is it acceptable for machine in- 
telligence to be wrong (e.g., during learning)? Must the 
machine solution be "optimal," or is "feasible" good 
enough? Which decisions can the machine make without 
human supervision, and which require human interven- 
tion? In a related vein, how much information should be 
displayed to the human operator? Should intelligent flight 
control ever be fully autonomous? If the control system 
adapts, how quickly must it adapt? Must learning occur 
on-line, or can it be delayed until a mission is completed? 
All of these questions must be answered in every potential 
application of intelligent control. 

Cognitive and Biological Paradigms for In- 
telligence 

Intelligence is the "ability involved in calculating, 
reasoning, perceiving relationships and analogies, learning 
quickly, storing and retrieving information .... classifying, 
generalizing, and adjusting to new situations" [29]. This 
definition does not deal with the mechanisms by which 
intelligence is realized, and it makes the tacit assumption 
that intelligence is a human trait. Intelligence relates not 
only to intellectuality and cognition but to personality and 
the environment [30]. 

The debate over whcthcr-or-not computers ever will 
"think” may never be resolved, though this need not re- 
strict our working models for computer-based intelligent 
control. One argument against the proposition is that 
computers deal with syntax (form), while minds deal with 
semantics (meaning), and syntax alone cannot produce 
semantics [31]. This does not limit the ability of a com- 
puter to mimic natural intelligence in a limited domain. 
Another contention is that thinking is "non-algorilhmic,” 
that the brain evokes consciousness through a process of 
natural selection and inheritance [32]. Consciousness is 
required for common sense, judgment of truth, under- 
standing, and artistic appraisal, concepts that arc not for- 
mal and cannot readily be programmed for a computer 
(i.e., they are not syntactic). 

Conversely, functions that are automatic or "mind- 
less" (i.e., that are unconscious), could be programmed, 
implying that computers have more in common with "un- 
intelligent" functions. Gtidel's Theorem 2 * is offered in 


2 As summarized in [32]: Any algorithm used to establish a 

mathematical truth cannot prove the propositions on which it is 

based. Or another [33]: Logical systems have to be fixed up "by 
calling the undecidable statements axioms and thereby declaring 
them to be true," causing new undecidable statements to crop up. 


[33] as an example of an accepted proposition that may be 
considered non-algorithmic; the statement and proof of 
the theorem must themselves be non-algorithmic and, 
therefore, not computable. However, while the human 
curiosity, intuition, and creativity that led to GOdel’s 
Theorem may not be replicable in a computer, the state- 
ment and proof are expressed in a formal way, so they 
might be considered algorithmic after all. 

The notion that syntax alone cannot produce seman- 
tics is attacked as being an axiom that is perhaps true but 
not knowable in any practical sense [34]; therefore, the 
possibility that a computer can "think" is not ruled out. A 
further defense is offered in [35], which suggests that 
human inference may be based, in part, on inconsistent 
axioms. This could lead to rule-based decisions that are 
not logically consistent, that are affected by heuristic bi- 
ases or sensitivities, that may reflect deeper wisdom, or 
that may be wrong or contradictory. For example, knowl- 
edge and belief may be indistinguishable in conscious 
thought; however, one implies truth and the other bias or 
uncertainty. One might also postulate the use of meta-rule 
bases that govern apparently non-algorithmic behavior. 
The process of searching a data base, though bound by 
explicit symbolic or numerical algorithms, may include 
randomized behavior (e.g., genetic algorithms) that are 
not immediately identifiable as algorithmic. 

More to our point, it is likely that a computer capable 
of passing a flying-qualities/pilol-workload/control-lheo- 
rctic equivalent of the Turing test^ [36] could be built 
even though that computer might not understand what it is 
doing 4 . For intelligent flight control, the principal objec- 
tive is improved control performance, that is, for im- 
proved input-output behavior. The computer can achieve 
the operative equivalent of consciousness on its own 
terms and in a limited domain, even if it does not possess 
emotions or other human traits. 

Discussions of human consciousness naturally fall 
into using the terminology of computer science. It is con- 
venient — as well as consistent with empirical data - to 
identify four types of thought: conscious, preconscious, 
subconscious, and unconscious [37]. Conscious thought 
is the thought that occupies our attention, that requires fo- 
cus, awareness, reflection, and perhaps some rehearsal. 
Conscious thought performs declarative processing of the 
individual’s knowledge or beliefs. It makes language, 
emotion, artistry, and philosophy possible. Unconscious 
thought "describes those products of the perceptual sys- 
tem that go unattended or unrehearsed, and those memo- 
ries that are lost from primary memory through display or 
displacement" [37]. Within the unconscious, we may 


2 Turing suggested dial a computer could be considered "intelli- 
gent" if it could "converse" with a human in a manner that is in- 
distinguishable from a human conversing with a human. 

4 ScJirlc describes such a computer as a "Chinese Room" that 
translates Chinese characters correctly by following rules while 
not understanding the language in [31]. 
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further identify two important components. Subconscious 
thought is procedural knowledge that is below our level of 
awareness but central to the implementation of intelligent 
behavior. It facilitates communication with the outside 
world and with other parts of the body, providing the 
principal home for the learned skills of art, athletics, con- 
trol of objects, and craft. We are aware of perceptions if 
they are brought to consciousness, but they also may lake 
a subliminal (subconscious) path to memory. Precon- 
scious thought is pre-attentive declarative processing that 
helps choose the objects of our conscious thought, operat- 
ing on larger chunks of information or at a more symbolic 
level. It forms a channel to long-term and implicit mem- 
ory, and it may play a role in judgment and intuition. 

Whether we adopt a single-processor model of con- 
sciousness such as Adaptive Control of Thought (ACT* 
as in [38]) or a connectionist model like Parallel Dis- 
tributed Processing (PDP from [39]), we arc led to believe 
that the central nervous system supports a hierarchy of in- 
telligent and automatic functions with declarative actions 
at the top, procedural actions in the middle, and reflexive 
actions at the bottom. Declarative thinking occurs in the 
brain's cerebral cortex, which accesses the interior limbic 
system for long-term memory [40]. Together, they pro- 
vide the processing unit for conscious thought. Regions 
of the cerebral cortex are associated with different intel- 
lectual and physical functions; the distinction between 
conscious and preconscious function may depend on the 
activation level and duration in regions of the cerebral 
cortex. 

The working memory of conscious thought has ac- 
cess to the spinal cord through other brain parts that are 
capable of taking procedural action (e.g., the brain stem 
for autonomic functions, the occipital lobes for vision, and 
the cerebellum for movement). Procedural action can be 
associated with subconscious thought, which supports vol- 
untary automatic processes like movement and sensing. 
Voluntary signals are sent over the somatic nervous sys- 
tem, transmitting to muscles through the motor neural 
system and from receptors through the sensory neural sys- 
tem. 

The spinal cord itself "closes the control loop" for re- 
flexive actions long before signals could be processed by 
the brain. Nevertheless, these signals are available to the 
brain for procedural and declarative processing. We are 
all aware of performing some task (e.g., skating or riding 
a bicycle) without effort, only to waver when wc focus on 
what we are doing. Involuntary regulation of the body’s 
organs is provided by the autonomic nervous system, 
which is subject to unconscious processing by the brain 
stem. "Bio-feedback" can be learned, allowing a modest 
degree of higher-level control over some autonomic func- 
tions. 

Declarative, procedural, and reflexive functions can 
be built into a model of intelligent control behavior (Fig. 
2). The Conscious Thought module governs the system 
by performing declarative functions, receiving informa- 


tion and transmitting commands through the Subconscious 
Thought module, which is itself capable of performing 
procedural actions. Conscious Thought is primed by Pre- 
conscious Thought [41], which can perform symbolic 
declarative functions and is alerted to pending tasks by 
Subconscious Thought. These three modules overlie a 
bed of deeper Unconscious Thought that contains long- 
term memory. They are capable of intellectual learning, 
and while their physical manifestation may be like the 
PDP model, they exhibit characteristics that are most 
readily expressed by the ACT* model 5 . 

The Subconscious Thought module receives informa- 
tion from the Sensory System and conveys commands to 
the Muscular System through peripheral networks. Volun- 
tary Reflexive Actions provide low-level regulation in par- 
allel with the high-level functions, responding to critical 
stimuli and coordinating control actions. High- and low- 
level commands may act in concert, or one may block the 
other. Voluntary Reflexive Actions can be trained by 
high-level directives from Subconscious Thought, while 
the learning capabilities of involuntary Reflexive Action 
are less clear. Control actions produce Body motion and 
can affect an external Controlled System , as in piloting an 
aircraft. In learned control functions. Body motion helps 
internalize the mental model of Controlled System behav- 
ior. The Body and the Controlled System are both di- 
rectly or indirectly subjected to Disturbances; ; for exam- 
ple, turbulence would affect an aircraft directly and the pi- 
lot indirectly. The Sensory System observes External 
Events as well as Body and Controlled System motions, 
and it is subject to Measurement Errors . 

There arc many parallels and analogies to be drawn in 
comparing the functions of human and computer-based 
intelligence. It may be useful to ponder a few, especially 
those related to knowledge acquisition, natural behavior, 
aging, and control. Perhaps the most important observa- 
tion is that learning requires error or incompleteness. 
There is nothing new to be gained by observing a process 
that is operating perfectly. In a control context, any oper- 
ation should be started using the best available knowledge 
of the process and the most complete control resources. 
Consequently, learning is not always necessary or even 
desirable in a flight control system. Biological adaptation 
is a slow process , and proper changes in behavior can be 
made only if there is prior knowledge of alternatives. If 
adaptation occurs too quickly, there is the danger that 
misperceptions or disturbance effects will be misinter- 
preted as parametric effects. Rest is an essential feature 
of intelligent biological systems. It has been conjectured 
that REM (rapid-eye-movement) Sleep is a time of learn- 


5 ... although the actual processing mechanism is not clear. In a 
recent seminar at Princeton (March 9, 1992), Herbert Simon 
noted that if you open the cabinet containing a sequential-pro- 
cessing computer, the innards look very much like those of a 
parallel processor. 
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ing, consolidating, and pruning knowledge 6 [42]. Sys- 
tems can learn even when they are not functioning by re- 
viewing past performance, perhaps in a repetitive or 
episodic way. 




Controlled 



System 



Events 


Figure 2. A Model of Cognitive/Biological Control Be- 
havior. 


The cells of biological systems undergo a continuing 
birth-life-death process , with new cells replacing old; na- 
ture provides a means of transmitting genetic codes from 
cell to cell. Nevertheless, the central nervous system is 
incapable of functional regeneration. Once a portion of 
the system has been damaged, it cannot be replaced, al- 
though redundant neural circuitry can work around some 
injuries. Short-term memory often recedes into long-term 
memory , where it generally takes longer to be retrieved. 
With time, items in memory that are less important are 
forgotten, possibly replaced by more important informa- 
tion ; hence, information has a half-life that depends upon 
its significance to our lives (and perhaps to its "refresh 
rate”). Humans develop the capability to form chords of 
actions that are orchestrated or coordinated to achieve a 
single goal. Response to an automotive emergency may 
include applying the brakes, disengaging the clutch, steer- 
ing to avoid an obstacle, and bracing for impact all at 
once. We develop ^knee-jerk'* reactions that combine 


6 "In REM Sleep, the brain is barraged by signals from the brain 
stem. Impulses fired to the visual cortex produce images that 
may contain materials from the day's experiences, unsolved 
problems, and unfinished business." [42] 


declarative, procedural, and reflexive functions, like clap- 
ping after the last movement of a symphony. 

Nature also provides structural paradigms for control 
that arc worth emulating in machines. First, there is a 
richness of sensory information that is hard to fathom, 
with millions of sensors providing information. This re- 
sults in high signal-to-noise ratio in some cases, and it al- 
lows symbolic/image processing in others. Those signals 
requiring high-bandwidth, high-resolution channel capac- 
ity (vision, sound, and balance) have short, dedicated, 
parallel runs from the sensors to the brain. This enhances 
the security of the channels and protects the signals from 
noise contamination. Dissimilar but related sensory in- 
puts facilitate interpretation of data. A single motion can 
be sensed by the eyes, by the inner ear, and by the "seat- 
of-lhe-pants" (i.e., by sensing forces on the body itselQ, 
corroborating each other and suggesting appropriate ac- 
tions. When these signals arc made to disagree in mov- 
ing-cockpit simulation of flight, a pilot may experience a 
sense of confusion and disorientation. 

There arc hierarchical and redundant structures 
throughout the body. The nervous system is a prime ex- 
ample, bringing inputs from myriad sensors (both similar 
and dissimilar) to the brain, and performing low-level rea- 
soning as an adjunct. Many sensing organs occur in pairs 
(c.g., eyes, cars, inner cars), and their internal structures 
are highly parallel. Pairing allows graceful degradation 
in the event that an organ is lost. Stereo vision vanishes 
with the loss of an eye, but the remaining eye can provide 
both foveal and peripheral vision, as well as a degree of 
depth perception through object size and stadiamctric pro- 
cessing. Our control effectors (arms, hands, legs, feet) 
also occur in pairs, and there is an element of "Fail- 
Op/Fail-Op/Fail-Safe” design [43] in the number of fin- 
gers provided for manual dexterity. 

Structure for Intelligent Flight Control 

The preceding section leads to a control system 
structure that overlays the cognitive/biological model of 
Fig. 2 on the flight control block diagram of Fig. 1 and 
adds new functions. The suggested structure (Fig. 3) has 
super-blocks identifying declarative, procedural, and re- 
flexive functions; these contain the classical GNC func- 
tions plus new functions related to decision-making, pre- 
diction, and learning. The black arrows denote informa- 
tion [low for the primary GNC functions, while the gray 
arrows illustrate the data flow that supports subsidiary ad- 
justment of goals, rules, and laws. 

Within the super-blocks, higher-level functions are 
identified as conscious, preconscious, and subconscious 
attributes, not with disregard for the philosophical objec- 
tions raised earlier but as a working analog for establish- 
ing a computational hierarchy. The new functions relate 
to setting or revising goals for the aircraft’s mission, moni- 
toring and adjusting the aircraft’s systems and subsystems, 
identifying changing characteristics of the aircraft and its 
environment, and applying this knowledge to modify the 
structures and parameters of GNC functions. 
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The suggested structure has implications for both 
hardware and software. Declarative functions arc most 
readily identified with single-processor computers pro- 
grammed in LISP or Prolog, as decision-making is associ- 
ated with list processing and the statement of logical rela- 
tionships. Procedural functions can be conceptual i/cd as 
vector or "pipelined" processes programmed in FOR- 
TRAN, Pascal, or C, languages that have been developed 
for numerical computation with subroutines, arrays, dif- 
ferential equations, and recursions. Reflexive functions 
seem best modeled as highly parallel processes imple- 
mented by neural networks, which apply dense mappings 
to large masses of data almost instantaneously. Neverthe- 
less, parallel processes can be implemented using sequen- 
tial processors, and sequential algorithms can be "paral- 
lelized" for execution on parallel processors. The choice 
of hardware and software depends as much on the current 
state-of-the-art as on the closeness of computational re- 
quirements and GNC functions. 

In the remainder of the paper, declarative, procedural, 
and reflexive control functions are discussed from an 
aerospace perspective. In practice, the boundaries be- 
tween mission tasks may not be well defined, and there is 
overlap in the kinds of algorithms that might be applied 
within each group. A number of practical issues related to 
human factors, system management, certifiabilily, main- 
tenance, and logistics are critical to the success! ul imple- 
mentation of intelligent flight control, but they arc not 
treated here. 


DECLARATIVE SYSTEMS 

Goal planning, system monitoring, and control-mode 
switching arc declarative functions that require reasoning. 
Alternatives must be evaluated, and decisions must be 
made through a process of deduction , that is, by inferring 
answers from general or domain-specific principles. The 
inverse process of learning principles from examples is 
induction , and not all declarative systems have this ca- 
pability. Most declarative systems have fixed structure 
and parameters, with knowledge induced off-line and be- 
fore application; declarative systems that learn on-line 
must possess a higher level of reasoning ability, perhaps 
through an internal declarative module that specializes in 
training. 

Expert Systems 

Expert Systems are computer programs that use phys- 
ical or heuristic relationships and facts for interpretation, 
diagnosis, monitoring, prediction, planning, and design. 
In principal, an expert system replicates the decision-mak- 
ing process of one or more experts who understand the 
causal or structural nature of the problem [44]. While 
human experts may employ "nonmonotonic reasoning" 
and "common sense" to deduce facts that apparently defy 
simple logic, computational expert systems typically are 
formal and consistent, basing their conclusions on analo- 
gous cases or well-defined rules 7 . 


7 Expert systems can have tree or graph structures. In the for- 
mer, there is a single root node, and all final ( leaf) nodes are 
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A rule-based expert system consists of data , rules , 
and an inference engine [46]. It generates actions predi- 
cated on its data base, which contains measurements as 
well as stored data or operator inputs. An expert system 
performs deduction using knowledge and beliefs ex- 
pressed as parameters and rules. Parameters have values 
that either are external to the expert system or arc set by 
rules. An "IF-THEN” rule evaluates a premise by testing 
values of one or more parameters related by logical 
"ANDs” or ”ORs," as appropriate, and it specifics an ac- 
tion that set values of one or more parameters. 

The rule base contains all the cause-and-cffcct rela- 
tionships of the expert system, and the inference engine 
performs its function by searching the rule base. Given a 
set of premises (evidence of the current state), the logical 
outcome of these premises is found by a data-driven 
search (forward chaining) through the rules. Given a de- 
sired or unknown parameter value, the premises needed to 
support the fixed or free value arc identified by a goal-di- 
rected search ( backward chaining) through the rules. 
Querying (or firing) a rule when searching in either direc- 
tion may invoke procedures that produce parameter values 
through side effects [47], 

Rules and parameters can be represented as objects or 
frames using ordered lists that identify names and at- 
tributes. Specific rules and parameters arc represented by 
lists in which values are given to the names and attributes. 
The attribute lists contain not only values and logic but 
additional information for the inference engine. This in- 
formation can be used to compile parameter -rule-associa- 
tion lists that speed execution [48]. Frames provide useful 
parameter structures for related productions, such as ana- 
lyzing the origin of one or more failures in a complex, 
connected system [49]. Frames possess an inheritance 
property ; thus a particular object lays claim to the proper- 
ties of the object type. 

Crew/Team Paradigms for Declarative Flight 
Control 

Logical task-classification is a key factor in the de- 
velopment of rule-based systems. To this point, we have 
focused on the intelligence of an individual as a paradigm 
for control system design, but it is useful to consider the 
hypothetical actions of a multi-person aircraft crew as 
well. In the process, we develop an expert system of ex- 
pert systems, a hierarchical structure that reasons and 
communicates like a team of cooperating, well-trained 
people might. This notion is expanded in |50-53]. The 
Pilot's Associate Program initially focused on a four-task 
structure and evolved in the direction of the multiple 
crew-member paradigm [54-56]. 

AUTOCREW is an ensemble of nine cooperating 
rule-based systems, each figuratively emulating a member 


connected to their own single branch . In the latter, one or more 
branches lead to individual nodes. Reasoning is consistent if an 
individual node is not assigned differing values by different 
branches [45]. 


of a World War II bomber crew: executive (pilot), co-pi- 
lot, navigator, flight engineer, communicator, spoofer 
(countermeasures), observer, attacker, and defender (Fig. 
4) [53]. The executive coordinates mission-specific tasks 
and has knowledge of the mission plan. The aircraft's 
human pilot can monitor AUTOCREW functions, follow 
its suggestions, enter queries, and assume full control if 
confidence is lost in the automated solution. The overall 
goal is to reduce the pilot's need to regulate the system 
directly without removing discretionary options. AU- 
TOCREW contains over 500 parameters and over 400 
rules. 
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Figure 4. AUTOCREW Configuration with Pilot/Aircraft 
Interface (adapted from [52]). 

AUTOCREW was developed by defining each mem- 
ber expert system as a knowledge base y according to the 
following principles: 

• Divide each knowledge base into task groups: time- 
critical, routine, and mission-specific. 

• Order task groups from most to least time-critical 
to quicken Lhe inference engine’s search. 

• Break major tasks into sub-tasks according to need 
for communicating system functions. 

• Identify areas of cooperation between knowledge 
bases. 

The five main task groups for each crew member are: 
tasks executed during attack on the aircraft, tasks executed 
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during emergency or potential threat, tasks ordered by the 
EXECUTIVE, tasks executed on a routine basis, and mis- 
sion-specific tasks. Routine and mission-specific tasks 
are executed on each cycle; emergency tasks arc executed 
only when the situation warrants. Operation of AU- 
TOCREW was simulated to obtain comparative expert- 
system workloads for two mission scenarios: inbound 
surface-to-air missile attack and human pilot incapacita- 
tion [52]. In addition to the overall AUTOCREW system, 
a functioning NAVIGATOR sensor-management expert 
system was developed. Knowledge acquisition for the 
system is challenging because traditional methods (c.g., 
domain-expert interviews) do not provide sufficiently de- 
tailed information to design the system [57]. 

Additional perspectives on intelligent flight manage- 
ment functions can be obtained from the literature on de- 
cision-making by teams, as in [58-60], Alternate ap- 
proaches to aiding the pilot in emergencies arc given in 
[61,62]. 

Reasoning Under Uncertainty 

Rule-based control systems must make decisions un- 
der uncertainty. Measurements are noisy, physical sys- 
tems are subject to random disturbances, and the envi- 
ronment within which decisions must be made is ambigu- 
ous. For procedural systems, the formalism of optimal 
state estimation provides a rigorous and useful means of 
handling uncertainty [63]. For declarative systems, there 
are a number of methods of uncertainty management, in- 
cluding probability theory, Dempstcr-Shafcr theory, pos- 
sibility theory (fuzzy logic), certainty factors, and the the- 
ory of endorsements [64]. 

Bayesian belief networks [65], which propagate event 
probabilities up and down a causal irce using Bayes's rule, 
have particular appeal for intelligent control applications 
because they deal with probabilities, which form the basis 
for stochastic optimal control. We have applied Bayesian 
networks to aiding a pilot who may be flying in the vicin- 
ity of hazardous wind shear [66]. Figure 5 shows a net- 
work of the causal relationships among meteorological 
phenomena associated with microbursi wind shear, as 
well as temporal and spatial information that could affect 
the likelihood of microburst activity. A probability of oc- 
currence is associated with each node, and a conditional 
probability based on empirical data is assigned to each ar- 
row. The probability of encountering microburst wind 
shear is the principal concern; however, each time new 
evidence of a particular phenomenon is obtained, proba- 
bilities are updated throughout the entire tree. In the pro- 
cess, the estimated likelihood of actually encountering the 
hazardous wind condition on the plane’s flight path is re- 
fined. 

The safety of aircraft operations near microbursi wind 
shear will be materially improved by forward-looking 
Doppler radar, which can sense variations in the wind 
speed. Procedural functions that can improve the reliabil- 
ity of the wind shear expert system include extended 


Kalman filtering of Lhe velocity measurements at incre- 
mental ranges ahead of the aircraft [67]. 



Figure 5. Bayesian Belief Network to Aid Wind Shear 
Avoidance (adapted from [67]). 

Probabilistic reasoning of a different sort has been 
applied to a problem in automotive guidance that may 
have application in future Intelligent Vchiclc/Highway 
Systems [68-70J. Intelligent guidance for headway and 
lane control on a highway with surrounding traffic is 
based on worst-plausible-case decision-making. It is as- 
sumed that the intelligent automobile (IA) has imaging 
capability as well as on-board motion sensors; hence, it 
can deduce the speed and position of neighboring auto- 
mobiles. Each automobile is modeled as a simple dis- 
crete-time dynamic system, and estimates of vehicle states 
arc propagated using extended Kalman filters [63]. There 
arc limits on the performance capabilities of all vehicles, 
and IA strategy is developed using time-to-collide, brak- 
ing ratios, driver aggressiveness, and desired security fac- 
tors. Plausible guidance commands are formulated by 
minimizing a cost function based on these factors [70]. 
Both normal and emergency expert systems govern the 
process, supported by procedural calculations for situation 
assessment, traffic prediction, estimation, and control 
(Fig. 6). Guidance commands arc formulated by mini- 
mizing a cost function based on these factors [70]. 

Each of the expert systems discussed in this section 
performs deduction in a cyclical fashion based on prior 
logical structures, prior knowledge of parameters, and 
real-time measurements. Intelligent flight control systems 
must deal with unanticipated events, but it is difficult to 
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identify aeronautical applications where on-line declara- 
tive learning is desirable. Nevertheless, off-line induction 
is needed to formulate the initial declarative system and 
perhaps (in a manner reminiscent of REM Sleep) to up- 
grade declarative logic between missions. 



Figure 6. Intelligent Guidance for Automotive Headway 
and Lane Control [69]. 

Inducing Knowledge in Declarative Systems 

In common usage, "learning" may refer a) to collect- 
ing inputs and deducing outputs and b) to inducing the 
logic that relates inputs and outputs to specific tasks. 
Here, we view the first process as the normal function of 
the intelligent system and the second as "learning." 
Teaching an expert system the rules and parameters that 
generalize the decision-making process from specific 
knowledge is the inverse of expert-system operation. 
Given all possible values of the parameters, what arc the 
rules that connect them? Perhaps the most common an- 
swer is to interview experts in an attempt to capture the 
logic that they use, or failing that, to study the problem 
intensely so that one becomes expert enough to identify 
naturally intelligent solutions. These approaches can be 
formalized [71, 72], and they were the ones used in [67] 
and [68]. Overviews of alternatives for induction can be 
found in [45,46,73,74]. 

Two approaches are considered in greater detail. The 
first is called rule recruitment [75], and it involves the 
manipulation of "dormant rules" (or rule templates ). This 
method was applied in the development of an intelligent 
failure-tolerant control system for a helicopter. Each 
template possesses a fixed premise-action structure and 
refers to parameters through pointers. Rules are con- 
structed and incorporated in the rule base by defining 
links and modifying parameter-rule-association lists. 
Learning is based on Monte Carlo simulations of the con- 


trolled system with alternate failure scenarios. Learned 
parameter values then can be defined as "fuzzy functions" 
[76] contained in rule premises. 

The second approach is to construct classification or 
decision trees that relate attributes in the data to decision 
classes [52]. The problem is to develop an Expert Navi- 
gation-Sensor Management System (NSM) that selects the 
best navigation aids from available measurements. Sev- 
eral aircraft paths were simulated, and the corresponding 
measurements that would have been made by GPS, Loran, 
Tacan, VOR, DME, Doppler radar, air data, and inertial 
sensors were calculated, with representative noise added. 
The simulated measurements were processed by extended 
Kalman fillers to obtain optimal state estimates in 200 
simulations. Using the root-sum-square error as a deci- 
sion metric. Analysis of Variance (ANOVA) identifies the 
factors that make statistically significant contributions to 
the decision metric, and the Iterative Dichotomizer #3 
(1D3) algorithm [77-79] extracts rules from the training 
set by inductive inference. The ID3 algorithm quantifies 
the entropy content of each attribute, that is, the informa- 
tion gained by testing the attribute at a given decision 
node. It uses an information-theoretic measure to find a 
splitting strategy that minimizes the number of nodes re- 
quired to characterize the tree. Over 900 examples were 
used to develop the NSM decision tree. 

PROCEDURAL SYSTEMS 

Most guidance, navigation, and control systems 
fielded to date are procedural systems using sequential al- 
gorithms and processors. Although optimality of a cost 
function is not always a necessary or even sufficient con- 
dition for a "good" system, linear-optimal stochastic con- 
trollers provide a good generic structure for discussion. 

Control and Estimation 

We assume that a nominal (desired) flight path is 
generated by higher-level intelligence, such as the human 
pilot or declarative machine logic. The procedural system 
must follow the path, x*(f) in t 0 < t < tf. Control is exer- 
cised by a digital computer at time intervals of At. The n- 
dimensional state vector perturbation at time tfc is xfc, and 
the /n-dimcnsional control vector perturbation is ufc. The 
discrete-time linear-quadratic-Gaussian (LQG) control 
law is formed as [63], 

Uk = u*k - Celxk - X*ld = CFy*k - Cexk (1 

y*k is the desired value of an output vector (defined as 

Rx x k + Hu u k)» an d *k is the Kalman filter estimate, ex- 
pressed in two steps: 

xk(-) = ^k-i(+) + r uk -i 

*k = xk(+) = xk(-) + K[*k - Hobs*k(-)l (2 
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The forward and feedback control gain matrices are Cp 
and Cb^ and T are state-transition and control -effect 
matrices that describe the aircrafts assumed dynamics, the 
estimator gain matrix is K, and the measurement vector, 
zfc, is a transformation of the state through Mohs- The 
gains Cb and K result from solving two Riccati equations 
that introduce tradeoffs between control use and slate per- 
turbation and between the strengths of random distur- 
bances and measurement error. Cf, which provides 
proper steady-state command response, is an algebraic 
function of Cb, 4>, r, and H 0 b s . All of the matrices may 
vary in time, and it may be necessary to compute K on- 
line. In the remainder, it is not essential that Cb and K be 
optimal (i.e., they may have been derived from eigen- 
structure assignment, loop shaping, etc.), although the 
LQR gains guarantee useful properties of the nominal 
closed-loop system [63]. 

The control structure provided by cq. 1 and 2 is quite 
flexible. It can represent a scalar feedback loop if z con- 
tains one measurement and u one control, or it can address 
measurement and control redundancy with z and u dimen- 
sions that exceed the dimension of the stale x. It also is 
possible to incorporate reduced-order modeling in the es- 
timator. Assuming that O and T have the same dimen- 
sions as the aircraft’s dynamic model (n x n and n x m ), 
the baseline estimator introduces n^-order compensation 
in the feedback control loop. The weights of the quadratic 
control cost function can be chosen not only to penalize 
state and control perturbations but to produce output 
weighting t state-rate weighting y and implicit model fol- 
lowing, all without modifying the dynamic model |63|. 
Integral compensationy low-pass filler compensation , and 
explicit model following can be obtained by augmenting 
the system model during the design process, increasing 
the compensation order and producing the control struc- 
tures shown in Fig. 7. 

These cost weighting and compensation features can 
be used together, as in the proportional-integral/implicit- 
modei-following controllers developed in [80], Implicit 
model following is especially valuable when an ideal 
model can be specified (as identified in flying qualities 
specifications and standards [5, 6]), and integral compen- 
sation provides automatic "trimming" (control that synthe- 
sizes u*^ corresponding to x*k to achieve zero steady- 
state command error) and low-frequency robustness. 
Combining integral and filter compensation produces con- 
trollers with good command tracking performance and 
smooth control actions, as demonstrated in flight tests 
[81-83]. The LQG regulator naturally introduces an in- 
ternal model of the controlled plant , a feature that facili- 
tates control design [84]. It produces a stable approxima- 
tion to the system inverse , which is at the heart of achiev- 
ing desired command tracking. 

The estimator in the feedback loop presents an effi- 
cient means of dealing with uncertainty in the measure- 
ments, in the disturbance inputs, and (to a degree) in the 
aircraft's dynamic model. If measurements arc very noisy, 


the estimator gain matrix K is "small,” so that the filter re- 
lies on extrapolation of the system model to estimate the 
stale. If disturbances are large, the state itself is more un- 
certain, and K is "large," pulling more emphasis on the 
measurements. Effects of uncertain parameters can be 
approximated as "process noise" that increases the impor- 
tance of measurements, leading to a higher K. If the sys- 
tem uncertainties are constant but unknown biases or scale 
factors, a better approach is to augment the filter state to 
estimate these terms directly. Parametric uncertainty in- 
troduces nonlinearity; hence, an extended Kalman filter 
must be used [63]. 



a) Lincar-Quadratic-Gaussian (LQG) Regulator. 



b) Proportional-Filter LQG Regulator. 



c) Proportional-Integral LQG Regulator. 



d) Explicit-Modcl-Following LQG Regulator. 


Figure 7. Structured Linear-Quadratic-Gaussian Regula- 
tors. 

Stability and Performance Robustness 

Controlled system robustness is the ability to main- 
tain satisfactory stability and performance in the presence 
of parametric or structural uncertainties in either the air- 
craft or its control system. All controlled systems must 
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possess some degree of robustness against operational 
parameter variations. The inherent stability margins of 
certain algebraic control laws (c.g., the linear-quadratic 
(LQ) regulator [63, 85-87]) may become vanishingly 
small when dynamic compensation (e.g., the estimator in 
a linear-quadratic-Gaussian (LQG) regulator) is added 
[88]. Restoring the robustness to that of the LQ regulator 
typically requires increasing estimator gains (within prac- 
tical limits) using the loop-transfer-recovery method [89) 
or the stochastic robustness approach described below. 

Subjective judgments must be made in assessing the 
need for robustness and in establishing corresponding 
control system design criteria, as there is an inevitable 
tradeoff between robustness and nominal system perfor- 
mance [90]. The designer must know the normal operat- 
ing ranges and distributions of parameter variations, as 
well as the specifications for system operability with 
failed components, else the final design may afford too lit- 
tle robustness for possible parameter variations or too 
much robustness for satisfactory nominal performance. 
Robustness traditionally has been assessed deterministi- 
cally [91, 92]; gain and phase margins arc an inherent part 
of the classical design of single-input/single-output sys- 
tems, and there are multi-input/multi-output equivalents 
based on singular-value analysis (c.g., [93]). A critical 
difficulty in applying these techniques is relating singular- 
value bounds on return-difference and invcrsc-rctum-dil- 
ference matrices to real parameter variations in the sys- 
tem. 

Statistical measures of robustness can use knowledge 
of potential variations in real parameters. The probability 
of instability was introduced in [94] and is further de- 
scribed in [95, 96]. The stochastic robustness of a linear, 
time-invariant system, is judged using Monte Carlo simu- 
lation to estimate the probability distributions of closed- 
loop eigenvalues, given the statistics of the variable pa- 
rameters in the system's dynamic model. Because the sys- 
tem is either stable or not, the probability of instability has 
a binomial distribution ; hence, the confidence intervals 
associated with estimating the metric from simulation arc 
independent of the number or nature of the uncertain pa- 
rameters [95]. 

Considerations of performance robustness arc easily 
taken into account in Stochastic Robustness Analysis 
(SRA). Systems designed using a variety of robust con- 
trol methods (loop transfer recovery, H**, optimization, 
structured covariance, and game theory) arc analyzed in 
[97], with attention directed to the probability of instabil- 
ity, probability of settling-time exceedence, probability of 
excess control usage, and tradeoffs between them. The 
analysis uncovers a wide range of system responses and 
graphically illustrates that gain and phase margins arc not 
good indicators of the probability of instability^. This 
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Real parameter variations affect not only the magnitude and 
relative phase angle of the system s Nyquist contour but its 
shape as well [63]. Therefore, the points along the contour that 


also raises doubts about the utility of singular values, as 
they arc multivariable equivalents of the classical robust- 
ness metrics. Incorporating SRA into the design of an 
LQG regulator with implicit model following and filter 
compensation leads to designs that have high levels of 
stability and performance robustness [98]. The reason for 
improvement is that SRA measures the actual effects of 
parameter variations on stability and performance rather 
than incremental changes in the nominal margins. 

Adaptation and Tolerance to Failures 

Adaptation always has been a critical element of sta- 
bility augmentation. Most aircraft requiring improved sta- 
bility undergo large variations in dynamic characteristics 
on a typical mission. Gain scheduling and control inter- 
connects initially were implemented mechanically, pneu- 
matically, and hydraulically; now the intelligent part is 
done within a computer, and there is increased freedom to 
use sophisticated scheduling techniques that approach full 
nonlinear control [81, 99]. 

One approach to improving failure tolerance is paral- 
lel redundancy : two or more control strings, each sepa- 
rately capable of satisfactory control, are implemented in 
parallel. A voting scheme is used for redundancy man- 
agement. With two identical channels, a comparator can 
determine whether or not control signals are identical; 
hence, it can delect a failure but cannot identify which 
string has failed. Using three identical channels, the con- 
trol signal with the middle value can be selected (or 
voted), assuring that a single failed channel never controls 
the plant. Parallel redundancy can protect against control- 
system component failures, but it docs not address failures 
of plant components. Analytical redundancy provides a 
capability to improve tolerance to failures of both types. 
The principal functions of analytical redundancy arefail- 
ure detection, failure identification, and control-system 
reconfiguration [47]. 

Procedural adaptation and failure-tolerance features 
will evolve outward, to become more declarative in their 
supervision and more reflexive in their implementation. 
Declarative functions are especially important for differ- 
entiating between normal and emergency control func- 
tions and sensitivities. They can work to reduce trim 
drag, to increase fatigue life, and to improve handling and 
ride qualities as functions of turbulence level, passenger 
loading, and so on. Gain-scheduling control can be 
viewed as fuzzy control, suggesting that the latter has a 
role to play in aircraft control systems [100-102]. Reflex- 
ive Junctions can be added by computational neural net- 
works that approximate nonlinear multivariate functions 
or classify failures. 


establish gain and phase margin (i.c., die corresponding Bode- 
plol frequencies) are subject to change. 
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Nonlinear Control 

Aircraft dynamics are inherently nonlinear, but aero- 
dynamic nonlinearilics and inertial coupling cllccts arc 
generally smooth enough in the principal operating re- 
gions to allow linear control design techniques to be used. 
Control actuators impose hard constraints on operation 
because their displacements and rates arc strictly limited. 
Nonlinear control laws can improve control precision and 
widen stability boundaries when flight must be conducted 
at high angles or high angular rates and when the control- 
actuator limits must be challenged. 

The general principles of nonlinear inverse control 
are straightforward [103]. Given a nonlinear system of 
the form, 

x = f(x) + G(x)u (3 

where G(x) is square ( m = n) and non-singular, the control 
law 

u = -G- 1 f(x) + G- l v ( 4 

inverts the system, since 

i = f(x) + G(x)[-G' 1 f(x) + G'M = v (5 

where v is the command input to the system. 

In general, G(x) is not square ( m * n); however, 
given an m-dimensional output vector, 

A ,r, 

y = Hx O’ 

it is possible to define a nonlinear feedback control law 
that produces output decoupling of the elements of y or 
their derivatives such that yW = v. The vector con- 
tains Lie derivatives of y, 

y (d) = = f*(x) + G*(x)u ( 7 

where d is the relative degree vector of differentiation re- 
quired to identify a direct control effect on each element 
of y. G*(x) and f*(x) are components of the Lie deriva- 
tives, and G*(x) is guaranteed to be structurally invertible 
by the condition that defines relative degree 1104]. The 
decoupling control law then takes the form 

u = -[G*(x)]'3f*(x) + [G*(x)]'^v ( R 

The control law is completed by feeding y back as appro- 
priate to achieve desired transient response and by pre-lil- 
tering v to produce the desired command response 1 105]. 
Because the full state is rarely measured and measure- 
ments can contain errors, it may be necessary to estimate 

x with an extended Kalman filter, substituting x for x in 
control computations. 


Evaluating G*(x) and f*(x) requires that a full, d- 
diffcrentiablc model of aircraft dynamics be included in 
the control system; hence the statement of the control law 
is simple, but its implementation is complex (Fig. 8). 
Smooth interpolators of the aircraft model (c.g., cubic 
splines) arc needed. Feedforward neural networks with 
sigmoidal activation functions are infinitely differentiable, 
providing a good means of representing this model on-line 
and allowing adaptation [ 1 06, 107]. Limitations to the in- 
verse control approach arc discussed in [108]. 



Figure 8. Decoupling Nonlinear-Inverse Control Law. 

REFLEXIVE SYSTEMS 

Inner-loop control is a reflexive (though not necessar- 
ily linear) function. To date, most inner loops have been 
designed as procedural control structures; computational 
neural networks may extend prior results to facilitate non- 
linear control and adaptation. Neural networks can be 
viewed as nonlinear generalizations of sensitivity, trans- 
formation, and gain matrices. Consequently, compensa- 
tion dynamics can be incorporated by following earlier 
models and comrol structures. Nonlinear proportional- 
integral and model following controllers, as well as non- 
linear estimators, can be built using computational neural 
networks. 

Computational Neural Networks 

Computational neural networks are motivated by in- 
pul-output and learning properties of biological neural 
systems, though in mathematical application the network 
becomes an abstraction that may bear little resemblance to 
its biological antecedent. Computational neural networks 
consist of nodes that simulate the neurons and weighting 
factors that simulate the synapses of a living nervous sys- 
tem. The nodes are nonlinear basis functions, and the 
weights contain knowledge of the system. Neural net- 
works are good candidates for performing a variety of re- 
flexive functions in intelligent control systems because 
they arc potentially very fast (in parallel hardware imple- 
mentation), they are intrinsically nonlinear, they can ad- 
dress problems of high dimension, and they can learn 
from experience. From the biological analogy, the neu- 
rons are modeled as switching functions that lake just two 
discrete values; however, "switching* may be softened to 
"saturation," not only to facilitate learning of the synaptic 
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weights but to admit the modeling of continuous, differ- a2 = (»)3, 03 = (*) 4 , eq. 9 is represented by the truncated 

entiable functions. power scries, 


The neural networks receiving most current attention 
are static expressions that perform one of two functions. 
The first is to approximate multivariate functions of the 
form 

y = h(x) (9 

where x and y are input and output vectors and /*(• ) is the 
(possibly unknown) relationship between them. Neural 
networks can be considered to be generalized spline func- 
tions that identify efficient input-output mappings from 
observations [109, 110]. The second application is to 
classify attributes , much like the decision trees mentioned 
earlier. (In fact, decision trees can be mapped to neural 
networks [111].) The following discussion emphasizes 
the first of these two applications. 

An N-layer feedforward neural network (FNN) repre- 
sents the function by a sequence of operations, 

rW = s ( k )[W( k - 1 )r( k -l)] = s ( k )[Tl( k )] , k = 1 to N (10 

where y = r( N ) and x = rW. w( k_1 ) is a matrix of 
weighting factors determined by the learning process, and 
s (k)[.] is an activation-function vector whose elements 
normally are identical, scalar, nonlinear functions Oj(qj) 
appearing at each node: 

s ( k )[Ti( k )] = [ai(iii( k )) ...a n (T|n( k ))] T (1 1 

One of the inputs to each layer may be a unity threshold 
element that adjusts the bias of the layer's output. Net- 
works consisting solely of linear activation functions arc 
of little interest, as they merely perform a linear transfor- 
mation f/, thus limiting eq. 9 to the form, y = Ilx. 

Figure 9 represents two simple feedforward neural 
networks. Each circle represents an arbitrary, scalar, non- 
linear function Oi(-) operating on the sum of its inputs, 
and each arrow transmits a signal from the previous node, 
multiplied by a weighting factor. A scalar network with a 
single hidden layer of four nodes and a unit threshold cl- 
ement (Fig. 9a) is clearly parallel, yet its output can be 
written as the series 

y = aoao(box + co) + aiai(bix + ci) +a2<J2(b2* + c 2) 
+a3 CT 3(b3 x + c 3) 0 2 

illustrating that parallel and serial processing may be 
equivalent. 

Consider a simple example. Various nodal activation 
functions, crj, have been used, and there is no need for 

each node to be identical. Choosing oq(*) = (•). o | = (•) , 


y = ao(bo* + co) + ai(bjx + ci) 2 + a2(b2 x + C2) 3 + 
a3(b3x + c3) 4 (13 

It is clear that network weights are redundant (i.e., that 
the (a, b, c) weighting factors are not independent). Con- 
sequently, more than one set of weights could produce the 
same functional relationship between x and y. Training 
sessions starling at different points could produce differ- 
ent sets of weights that yield identical outputs. This sim- 
ple example also indicates that the unstructured feedfor- 
ward network may not have compact support (i.e., its 
weights may have global effects) if its basis functions do 
not vanish for large magnitudes of their arguments. 



3 



a) Single-Inpul/Single-Output Network. 



b) Double-Input/S ingle-Output Network. 

Figure 9. Two Feedforward Neural Networks. 

The sigmoid is commonly used as the artificial neu- 
ron. It is a saturating function: o(r\) = 1/(1 + e _T l) for out- 
put in (0,1) or a(r\) = (1 -e 2r l)/(l + c 2 ^) = tanh r\ for 
output in (-1,1). Recent results indicate that any continu- 
ous mapping can be approximated arbitrarily closely with 
sigmoidal networks containing a single hidden layer (N = 
2) 1 1 12, 113]. Symmetric functions like the Gaussian ra- 

2 

dial basis function (o(r|) = e _T l ) have better convergence 
properties for many functions and have more compact 
support as a consequence of near-orthogonality [109, 
114]. Classical B -splines [115] could be expressed in par- 
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allel form, and it has been suggested that they be used in 
multi-layered networks [116]. Hidden layers strengthen 
the analogy to biological models, though they arc not nec- 
essary for approximating continuous functions, and they 
complicate the training process. 

In control application, neural networks perform func- 
tions analogous to gain scheduling or nonlinear control. 
Consider the simple two-input network of Fig. 9b. The 
scalar output and derivative of a single sigmoid with unit 
weights tire shown in Fig. 10. If u is a fast variable and v 
is a slow variable, choosing the proper weights on the in- 
puts and threshold can produce a gain schedule that is ap- 
proximately linear in one region and nonlinear (with an 
inflection point) in another. More complex surfaces can 
be generated by increasing the number of sigmoids. If u 
and v are both fast variables, then the sigmoid can repre- 
sent a generalization of their nonlinear interaction. 

For comparison, a typical radial basis function pro- 
duces the output shown in Fig. 11. Whereas the sigmoid 
has a preferred input axis and simple curvature, the RBF 
admits more complex curvature of the output surface, and 
its effect is more localized. The most efficient nodal acti- 
vation function depends on the general shape of the sur- 
face to be approximated. There may be cases best han- 
dled by a mix of sigmoids and RBF in the same network. 

The cerebellar model articulation controller (CM AC) 
is an alternate network formulation with somewhat differ- 
ent properties but similar promise for application in con- 
trol systems [117, 1 18]. The CM AC performs table look- 
up of a nonlinear function over a particular region of 
function space. CMAC operation can be split into two 
mappings. The first maps each input into an association 
space A . The mapping generates a selector vector a of 
dimension with c non-zero elements (usually ones) 
from overlapping receptive regions for the input. The 
second mapping, R, goes from the selector veclor a lo die 
scalar output y through the weight vector w , which is de- 
rived from training: 

y = w^a ( 1 4 

Training is inherently local, as the extent of the receptive 
regions is fixed. The CMAC has quantized output, pro- 
ducing "staircased" rather than continuous output. A re- 
cent paper proposes to smooth the output using B-splinc 
receptive regions [119]. 

The FNN and CMAC are both examples of static 
networks , that is, their outputs are essentially instanta- 
neous: given an input, the speed of output depends only 
on the speed of the computer. Dynamic networks rely on 
stable resonance of the network about an equilibrium 
condition to relate a fixed set of initial conditions to a 
steady-slate output. Bidirectional Associative Memory 
(BAM) networks [120] are nonlinear dynamical systems 
that subsume Hopfield networks [121], Adaptive-Rcso- 
nance-Theory (ART) networks [122], and Kohoncn net- 


works [123]. Like FNN, they use binary or sigmoidal 
neurons and store knowledge in the weights that connect 
them; however, the "neural circuits" take time to stabilize 
on an output. While dynamic networks may operate more 
like biological neurons, which have a refractory period 
between differing outputs, computational delay degrades 
aircraft control functions. 

Although neural networks performing function ap- 
proximation may gain little from multiple hidden layers, 
networks used for classification typically require multiple 
layers, as follows from the ability to map decision trees to 
neural networks [111]. The principal values of perform- 
ing such a mapping are that it identifies an efficient 
structure for parallel computation, and it may facilitate in- 
cremental learning and generalization. 



b) x-Dcrivative of Sigmoid. 


Figure 10. Example of Sigmoid Output with Two Inputs. 

Neural networks can be applied to failure detection 
and identification (FDI) by mapping data patterns (or 
feature vectors ) associated with failures onto detec - 
tor/identification vectors (e.g., [124-126]). To delect fail- 
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ure, the output is a scalar, and the network is trained (for 
example) with *T' corresponding to failure and "0” corre- 
sponding to no failure. The data patterns associated with 
each failure may require feature extraction , pre-process- 
ing that transforms the input time scries into a feature 
vector [124]. As an alternative, the feature vector could 
be specified as a parity vector [127], and the neural net- 
work could be used for the decision-making logic in FDI. 



a) Radial Basis Function (RBF). 



b) x-Derivative of RBF. 

Figure 11. Example of Radial Basis Function Output with 
Two Inputs. 

Reflexive Learning and Adaptation 

Training neural networks involves either supervised 
or unsupervised learning. In supervised learning , the net- 
work is furnished typical histories of inputs and outputs, 
and the training algorithm computes the weights that mini- 
mize fit error. FNN and CM AC require this type of train- 
ing, as discussed below. In unsupervised learning , the in- 
ternal dynamics are self-organizing, tuning the network to 
home on different cells of the output semantic map in re- 
sponse to differing input patterns [ 1 28 1 . 


Backpropagaiion learning algorithms for the ele- 
ments of VV(k) typically involve a gradient search (e.g., 
[129, 130]) that minimizes the mean-square output error 

I=[r,l-r(N)] T [r d -r(N)] (15 

where r<j is the desired output. For each input-output ex- 
ample presented to the network, the gradient of the error 
with respect to the weight matrix is calculated, and the 
weights arc updated by 

< ) w= vv !,id + P' (k - 1> [ d(k) ] T 06 

(3 is the learning rate, and d is a function of the error be- 
tween desired and actual outputs. For the output layer, the 
error term is 

d(N) = s'[vv(N-l) r (N-l)](r d -r(N)) (17 

where the prime indicates differentiation with respect to r. 
For interior layers, the error from the output layer is prop- 
agated from the output error using 

d(k) = s’[\v( k - 1 )r( k - 1 )] [w( k -D] T d(k-l) ( 18 

Search rate can be modified by adding momentum or 
conjugate-gradient terms to eq. 16. 

The CM AC network learning algorithm is similar to 
backpropagaiion. The weights and output arc connected 
by a simple linear operation, so a learning algorithm is 
easy to prescribe. Each weight contributing to a particular 
output value is adjusted by a fraction of the difference be- 
tween the network output and the desired output. The 
fraction is determined by the desired learning speed and 
the number of receptive regions contributing to the output. 

Learning speed and accuracy for FNN can be further 
improved using an extended Kalman filter [106, 107, 
131]. The dynamic and observation models for the filter 
are 


«k = "k-l +qk-l 

(19 

zk = h(wk, rk) + nk 

(20 


where \vk is a vector of the matrix W^'s elements, li(*) is 
an observation function, and qk and nk are noise pro- 
cesses. If the network has a scalar output, then zk is 
scalar, and the extended Kalman filter minimizes the fit 
error between the training hypcrsurface and that produced 
by the network (cq. 15). It has been found that the fit er- 
ror can be dramatically reduced by considering the gradi- 
ents of the surfaces as well [106, 107]. The observation 
vector becomes 
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with concomitant increase in the complexity of the filter. 
The relative significance given to function and derivative 
error during training can be adjusted through the mea- 
surement-error covariance matrix used in filter design. 

Recursive estimation of the weights is useful when 
smooth relationships between fit errors and the weights 
are expected, when the weight-vector dimension is not 
high, and when local minima are global. When one of 
these is not true, it may speed the compulation of weights 
to use a randomized search, at least until convergent re- 
gions are identified. Such methods as simulated anneal- 
ing or genetic algorithms can be considered (and the latter 
has philosophic appeal for intelligent systems) [132-134]. 
The first of these is motivated by statistical mechanics and 
the effects that controlled cooling has on the ground states 
of atoms (which are analogous to the network weights). 
The second models the reproduction, crossover, and mu- 
tation of biological strings (e.g., chromosomes, again 
analogous to the weights), in which only the fittest com- 
binations survive. 

Statistical search methods can go hand-in-hand with 
SRA to train robust neural networks. Following |98[, the 
randomized search could be combined with Monte Carlo 
variation of system parameters during training, numeri- 
cally minimizing the expected value of fit error rather than 
a deterministic fit error. 

We envision an aerodynamic model that spans the 
entire flight envelope of an aircraft, including post-stall 
and spinning regions. The model contains six neural net- 
works with multiple inputs and scalar outputs, three for 
force coefficients and three for moment coefficients (for 
example, the pitch moment network takes the form C m = 
g(x,u), where x represents the state and u the control). If 
input variables are not restricted to those having plausible 
aerodynamic effect, false correlations may be created in 
the network; hence, attitude Euler angles and horizontal 
position should be neglected, while physically meaningful 
terms like elevator deflection, angle of attack, pitch rate, 
Mach number, and dynamic pressure should be included 
(107], 

The aircraft spends most of its flying time within 
normal mission envelopes. Unless it is a trainer, the air- 
craft docs not enter post-stall and spinning regions; con- 
sequently, on-line network training focuses on normal 
flight and neglects extreme conditions. This implies not 
only that networks must be pre-lrained in the latter regions 
but that normal training must not destroy knowledge in 
extreme regions while improving knowledge in normal 
regions. Therefore, radial basis functions appear to be a 
better choice than sigmoid activation functions for adap- 
tive networks. 


Elements of the input vector may be strongly corre- 
lated with each other through the aircraft's equations of 
motion; hence, networks may not be able to distinguish 
between highly correlated variables (e.g., pitch rale and 
normal acceleration). This is problematical only when 
the aircraft is outside its normal flight envelope. Pre- 
training should provide inputs that are rich in frequency 
content, that span the state and control spaces, and that are 
as uncorrelaled as possible. Generalization between 
training points may provide smoothness, but it docs not 
guarantee accuracy. 

Control Systems Based on Neural Networks 

Neural networks can find application in logic for 
control, estimation, system identification, and physical 
modeling. In addition to work already referenced, addi- 
tional examples can be found in [135-140], 

Figure 12a illustrates an application in which the neu- 
ral network forms the aircraft model for a nonlinear-in- 
verse control law. The aircraft model of Fig. 9 is imple- 
mented with a neural network that is trained by a dedi- 
cated (weight) extended Kalman filter (the thick gray ar- 
row indicating training). The extended Kalman filter for 
state estimation is expanded to estimate histories of forces 
and moments as well as the usual motion variables. 

It is possible to conduct supervised learning on-line 
while not interfering with normal operation because the 
stale Kalman filter produces both the necessary inputs and 
the desired outputs for the network training algorithm. 
There is no need to provide an ideal control response for 
training, as the form of the control law is fixed. Procedu- 
ral and reflexive functions are combined in this control 
implementation, under the assumption that the direct ex- 
pression of inversion is the most efficient approach. 

Figure 12b shows a logical extension in which the in- 
verse control law is implemented by neural networks. In- 
version is an implicit goal of neural-network controllers 
[135, 136], and the formal existence of inversion net- 
works has been explored [141], Although Fig. 12b im- 
plies that the inversion networks are pre-trained and fixed, 
they, loo, can be trained with the explicit help of the net- 
work that models the system [136J. 

If a desired control output is specified (Fig. 12c), then 
the formal model of die aircraft is no longer needed. The 
control networks learn implicit knowledge of the aircraft 
model. Referring to Fig. 10 and eq. 1 and 2, control and 
estimation gains, state-transition and control-effect matri- 
ces, and measurement transformations can be imple- 
mented as static neural networks with either off-line or 
on-line learning. 

Dividing control networks into separate feedback and 
forward parts may facilitate training to achieve design 
goals. A feedback neural network has strongest effect on 
homogeneous modes of motion, while a forward neural 
network is most effective for shaping command (forced) 
response. This structure is adopted in [139], where the 
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forward and feedback networks are identified as reason 
and instinct networks . In pre-training, it is plausible that 
the feedback network would be trained with initial condi- 
tion responses first, to obtain satisfactory transient re- 
sponse. The forward network would be trained next to 
achieve desired steady states and input decoupling. A 
third training step could be the addition of command-error 
integrals while focusing on disturbance inputs and param- 
eter variations in training sets. 



a) Neural Network for Modeling and Adaptation. 


Once baselines have been achieved, it could prove 
useful to admit limited coupling between forward and 
feedback networks to enable additional nonlinear com- 
pensation. In adaptive applications, networks would be 
pre-trained with the best available models and scenarios to 
establish satisfactory baselines; on-line training would 
slowly adjust individual systems to vehicle and mission 
characteristics. 

Including the integral of command-vector error as a 
neural network input produces a proportional-integral 
structure [140], while placing the integrator beyond the 
network gives a proportional-filter structure (Fig. 10). 
The principal purpose of these structures is, as before, to 
assure good low- and high-frequency performance in a 
classical sense. Extension of neural networks to state and 
weight filters is a logical next step that is interesting in its 
own right as a means of more nearly optimal nonlinear es- 
timation. 

CREW-STRUCTURED INTELLIGENT 
AIRCRAFT CONTROL 



The declarative AUTOCREW paradigm presented 
earlier can be expanded to include procedural and reflex- 
ive functions, recognizing that control of flight is just one 
of several control functions in the aircraft. An intelligent 
control system for a civil aircraft might take the form of 
Fig. 13; functions represented by crew-member equiva- 
lents are linked to each other by a communications net- 
work and to aircraft systems via a separate network. This 
concept remains to be explored. 

CONCLUSION 


b) Neural Networks for Modeling, Adaptation, and Control. 
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c) Neural Networks for Control Alone. 

Figure 12. Adaptive Control Structures Using Neural 
Networks. 


Intelligent (light control systems can do much to im- 
prove the operating characteristics of aircraft. An exami- 
nation of cognitive and biological models for human con- 
trol of systems suggest that there is a declarative, procedu- 
ral, and reflexive hierarchy of functions. Top-level air- 
craft control functions arc analogous to conscious and 
prcconscious thoughts that arc transmitted to lower-level 
subsystems through subconscious, neural, and reflex-like 
activities. Human cognition and biology also suggest 
models for learning and adaptation, not only during opera- 
tion but between periods of activity. 

The computational analogs of the three cogni- 
tivc/biological paradigms are expert systems, stochastic 
controllers, and neural networks. Expert systems organize 
decision-making efficiently, stochastic controllers opti- 
mize estimation and control, and neural networks provide 
rapid, nonlinear, input-output functions. It appears that 
many functions at levels could be implemented as neu- 
ral networks. While this may not always be necessary or 
even desirable using sequential processors, mapping 
declarative and procedural functions as neural networks 
may prove most useful as a route to new algorithms for 
the massively parallel processors of the future. 
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